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Figure 2. Transcription and splicing of mouse genes \sy Sxerevisiae. (A) /}//7 
bcus (Table 1, exp. 2). Lane 1, DNA marker; lane 2, FFEHll total yeast DNA 
PGR (genomic DNA control); lane 3, mouse cDNA PGR (spliced control)- lane 
4, FFEHl 1 random primed RT-PCR; lane 5, RT- control of lane 4; lane 6 the 
same as in lane 4, but RNA was degraded withRNase A prior RT (abhrevikted 
hereupon RN-); lane 7, unrelated YAC clone RT^>CR (henceforth YAC-)- 
lane 8, water controL The product in lane 4 was sequenced and found to 
correspond to the unspUced Dill gene. S. spUced; NS, not spliced product. 
(B) Sphcingofthe mouse gene by the yeast (exp. 8), stained agarose gel 
Lane 1, DNA maikei^ lane 2, genomic DNA control; lane 3, mouse cDNA 
sphced control; lane 4, FFEHl 1 random primed RT-^*CR; lane 5, RT- control; 
lane 6, RN- control. The product in lane 4 was sequenced and found to 
correspond to the correctly spliced transcript of the Psmhl gene. (C) Dl 7Ph4e 
locus is transcribed from both DNA strands (exp. 7). Lane 1, DNA marker; 
lane 2. genomic DNA control; lane 3, mouse cDNA spUced control; lane 4* 
FFEHU oligo(dT) primed RT-PCR; lane 5, RT- control; lane 6, FFEHll RT 
pruned with a sense oUgo (77B); lane 7, RT- control for lane 6; lane 8, FFEHl 1 
RT pruned with an antisense ohgo (5F); lane 9, RT- control for lane 8; lane 1 0 
RN- control; lane U , YAC- control; lane 1 2, water controL The prodiit in lane 
4 was sequenced and found to correspond to the unsphced part of the D 1 7Ph4e 
gene. (D) Autoradiograph of Southern btot of the agarose gel from (C) hybridized 
with a DiTPWe probe. 



the YAC RT-PCR sequence spans only 58 bp of the V inteigenic 
region of the Psmhl gene, and that the rodent Psmhl gene 
transcript carries two alternative polyA signals. The second 
mouse positioning signal (AATAAA) is surrounded by three 
hexanucleotides (two of them overlapping) similar to the yeast 
efficiency polyA signal (TATDTA). While both the yeast and 
mouse polyA signals are putative, detected by conqniter searcliing 
for consensus sequences downstream to polyA sites, the polyA 
sites were detected experimentally by cDNA sequencing. Neither 
mouse polyA signal is used by the yeast cell, as the 3' RACE 
sequence exceeds the length of any mouse Pjmjbi mRNA, and the 
putative yeast polyadenylation site is located downstream to two 
overlapping TATDTA yeast signals. 

DISCUSSION 

Previous studies on yeast and mammalian gene e?q)ression revealed 
common features as well as differences in their transcription, 
sphdng and polyadenylation mechanisms. It has been documented 
that yeast can recognize mammalian promoters (13-15), but yeast 
and mammals vary in tiieir requiiemaits for the sequence context of 
the TATA boxes (1 6). Also, the intron splicing signals show several 
difiamces in spite of the common GT-AG rule, shared zaoss the 
eukaiyotic kingdom. For example, the yeast 3' splicing signal 
consensus does not have a polypyrimidine stretdi preceding the AG 




Figure 3. Promiscuous transcription of mouse non-coding DNA in the yeast 
(Table 1, exp. 14). RT-PCR detects an RNA transcript fiom the intergenic 
region between the first exons of the 7hp and Psmhl genes: lane 1 DNA 
mar^; lane 2, genomic DNA control; lane 3, FFEHll random primed 
RT-PCR; lane 4, RT- control; lane 5, FFEHll oligo(dT) primed RT-PCR* 
bne 6, RT- control for lane 5; lane 7. RN- control; lane 8, YAC- control; lane 
9, water control (A) stained agarose gel; (B) autoradiograph of the blotted gel 
hybridized widi a sequenced probe from the Psmhl promoter region 
(C)RNase protection assay with an RNA probe for the intergenic region 
between the Tbp and Psmhl genes. Lane 1 , yeast RNA control (1 0 ug)- lane 2 
FFEHll RNA (10 Mg); lane 3, FFEHU RNA (35 Mg); lane 4. moui RNA 
U ^g); lane 5, no RNase; lane 6, elution control The arrows mark the sizes of 
the eluted (516 bp) and ftiUy protected (411 bp) RNA probe. 



sequence, and the branching site is more conserved in the yeast than 
in mammals (17,18). 

The mammahan polyadenylation signals are not generally 
recognized in S.cerevisiae (19,20). Although the yeast and 
mammahan signals are both composed of at least three elements, 
the yeast efficiency elements are diflferent and the other two 
elements are more degenerate than the mammalian signals 
(reviewed in 2 1 ). The yeast efficiency signal optimal sequence is 
TATATA and hexanucleotides TATDTA have some activity, but 
several non-optimal elements are usually located downstream to 
the polyA site (22). 

In spite of aU these differences. Still et al. (4) iqx)rted 
successful screening for human genes in yeast RNA from the 
yeast clcHies canying overtyping human YACs. Of 27 diiferentiaUy 
expressed RNA fragments tested, the expression of four clones in 
human tissues (15%) was proven by PGR screening of five human 
cDNA hbraries or by their match to expressed sequence tags. The 
specificity of the processing of the human genes by the yeast was 
not tested. 

In the present report, we took advantage of a well characterized 
mouse YAC clone (7,10,M) to determine how efficient and 
specific is the yeast processing of the mouse DNA. Transcripts 
fix)m five tested mouse genes encoded within the YAC clone were 
all found in the total yeast RNA. Of 12 mouse introns assayed, six 
were correctly sphced by the yeast. Besides the transcripts of exon 
sequaices, 'yeast-specific' transcription of the YAC DNA was 
observed At least three genes were transcribed fiom their sense and 
antisense strands. Microsatellite, inter-repetitive, and anonymous 
mouse loci were detected in random- and ohgo(dT)-primed YAC 
RNA. A pair of primers derived ftx)m the first exons of two 
head-to-head oriented mouse genes yielded an RT-PCR product. 
An RNA probe, derived fix)m this intergenic region, was wholly 
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protected by the YAC RNA in an RNase protection assay. This 
finding indicates that the steady state levels of transcribed RNA 
from the non-coding mouse sequences are high enough to be 
detected by a technique other than RT-PCR. The sequence 
analysis of a 3' RACE product has shown that, in agreement with 
expectation (19,20), the mouse polyadenylation signals are not 
used by the yeast cell. 

How to explain the observed high frequency of illegitimate 
transcription of mouse DNA in the yeast? The presence of 
urispUced introns, transcripts from both DNA strands or from 
microsateihte repeats could be easily understood, if genomic 
DNA rather than reverse transcribed cDNA acted as a template 
for PCR ampHfication. However, we have effectively ruled out 
this possibility not only by including the RNase-fiee DNase 
treatment and RT minus controls, but also by abrogation of the 
PCR signal by RNA treatment with DNase-free RNase before 
reverse transcription. Moreover, when the primers for intCTgcnic 
endogenous yeast region were used, the RT-PCR did not yield 
any signal. No products were obtained also when two yeast genes 
were amphfied from sense-oligo primed RT reactions. 

Yeast transcription initiation complex recognizes the TATA box 
consensus, but it is not sensitive to the TATA box sequence 
context that is preferred by the mammahan transcription complex 
(1 6). Thus, due to a high ratio of non-coding to coding DNA in 
mammals, some random mouse sequences could serve as 
promotes in the yeast and explain thepresoicc of transcripts from 
YAC nouKXxiing re^ons or a non-ten^late DNA strand. The same 
aigummt may hold true for TATA-less transa:q)tion initiation. 

The sphcing of six mouse introns (out of 12 tested) described 
in this report provides the first piece of evidence for a successfiil 
sphcing of mammahan introns by yeast. The failure to sphce all 
introns could be explained by the fact that the branching site 
sequence is more conserved in the yeast compared with 
mammals, and its variation can abohsh sphcing in the yeast (i 8). 
Admittedly, sphcing signals of four analyzed introns did not show 
any apparent differences that could distinguish three non-sphced 
introns from one that was sphced. None of the examined mouse 
introns displayed a branchir^ site identical to the yeast consensus. 
One possible explanation could be that these introns are sphced 
only partially and the non-sphced product was not detectable by 
our method. It may be of some interest that though the yeast 
introns are generally short, at least one of the successfully sphced 
mouse introns spanned >5 kb. Our study has suggested that 
although mammahan polyA signals arc not used by yeast, the YAC 
RNAs can be polyadenylated, apparently due to the redundant 
occurrence of the degenerate yeast polyA signal sequences. 

The presait r^rt shows that the yeast transcription ^aratus 
transcribes mouse coding and non^xxing sequences with con^ar- 
able efficiency The enrichment with mammahan mRNA in YAC 
RNA seems rather low and a method based on a selection of 
mammalian mRNA in the YAC clone RNA would be expected to 
produce a high bacl^und. For example, a hypothetical mouse 
gene of 2 5 and a mRNA of 2 kb in si2e would be transcribed and 
^liced by the yeast into an RNA >12 kb, provided that half of its 
introns were property sphced The auichment would be even lower 
if the gene was transcribed from a random sequence recognized by 
the yeast transcription ^paratus on a non-template DNA strand that 
cannot be sphced at all. The YAC clone mdsr study covers a 
GC-ridi region, as judged by the fiequaicy of rareKJUtter restriction 
sites (10). Such regions are gene-rich and under-represented in 
mammals (23). Although it cannot be a priori excluded that the 



YAC transcription and RNA processiiig would be more faithful in 
GC-poOT YAC clones, the expected occurrence of TAIA and 
TATDTA sequences, and thaeby of possible transcription initiation 
and RNA 3' end fonnation sites is higher. 

The resulte presented here indicate that YAC clones can serve 
as in vivo test tubes for further analysis of the conservation of gene 
processing sequences. An intrigumg possibihty emeiges to 
develop new yeast host strains capable of im>gnizing some 
mammahan DNA and/or RNA processing signals, and thus 
enriching RNA of YAC clones with mammahan exons. 
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